Prediction of extreme events in the OFC model on a small world network 
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We investigate the predictability of extreme events in a dissipative Olami-Feder-Christensen model on a small 
world topology. Due to the mechanism of self-organized criticality, it is impossible to predict the magnitude 
of the next event knowing previous ones, if the system has an infinite size. However, by exploiting the finite 
size effects, we show that probabilistic predictions of the occurrence of extreme events in the next time step are 
possible in a finite system. In particular, the finiteness of the system unavoidably leads to repulsive temporal 
correlations of extreme events. The predictability of those is higher for larger magnitudes and for larger complex 
network sizes. Finally, we show that our prediction analysis is also robust by remarkably reducing the accessible 
number of events used to construct the optimal predictor. 



Introduction.- Self-organized criticality (SOC) is a mech- 
anism by which a large class of spatially extended dynamical 
systems, starting in a non-equilibrium uncorrected state, can 
spontaneously organize into a dynamical critical state with a 
high degree of correlations ^ . In particular, a partial synchro- 
nization of the elements of the system is usually responsible 
for building up long range spatial correlations and thereby cre- 
ating a critical state, in the thermodynamic limit. The emer- 
gence of this SOC complex behaviour in physical systems is 
manifested, for instance, by temporal and spatial scale invari- 
ance, i.e. power law or scale free behaviour. SOC has been 
proposed as a way to model the widespread occurrence of 
power laws in nature, i.e. the abundance of long-range corre- 
lations in space and time - similar to those observed in critical 
phase transitions |01 - of completely different events, e.g. lu- 
minosity of quasars, chemical reactions, evolution, sand-pile 
models, earthquakes, avalanches, forest burns, heart attacks, 
market crushes, etc. ([llSHl- Actually, the realistic appli- 
cability of SOC models to describe some of these real events 
is still debated - see, for instance, Refs. ifH [3 U10I1 . An im- 
portant consequence of the presence of SOC scenario is that 
in the critical state the evolution of the system is completely 
unpredictable. Indeed, once a SOC system reaches the critical 
state, arbitrarily large events can be generated intrinsically by 
the dynamics itself. Another important feature of the SOC be- 
haviour is that no fine-tuning of some external control param- 
eters is required. For a generic initial condition, after some 
time interval to let the system 'synchronize', it drives itself 
into a critical state. However, in the real world the real systems 
have a finite size and this unavoidably induces the presence of 
some extra correlations between following events. Such cor- 
relations will be used in the following as a useful source of 
information on the system evolution and they will allow us to 
forecast particular events. Actually, we will show that there 
exists some nontrivial predictability for large but finite system 
sizes and for the so-called extreme events. The latter can be 
defined as large deviations from the 'average' behaviour of a 
complex system and are normally caused by the presence of 
intrinsic dynamical fluctuations. These phenomena and their 
statistical properties have been intensively studied in literature 
inlljjt] but, however, the mechanisms and dynamics underly- 
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FIG. 1. Construction of a small world topology introducing a small 
fraction 2% of the total number of links) of long-range links in a 
20x20 square lattice. It corresponds to a rewiring probability equal 

iOPrew = 0.02. 



ing these huge deviations are not always fully clarified. More- 
over, the extreme events play an important role in nature and 
in our daily life because they are often associated to destruc- 
tive events, e.g. hurricanes, strong earthquakes, etc. In this re- 
spect, the predictability of extreme events is urgently desired 
but also intensely debated 11 1311 . In this paper, following the 
prediction analysis in sand-pile model in Ref. lll4ll . we will 
investigate the predictability of such extreme events, by ex- 
ploiting the time correlations induced by the finiteness of the 
system, in a SOC model on a complex network recently stud- 
ied in Refs. lll5Ul6ll . In other words, on the basis of past ob- 
served events, it is possible to predict when the next extreme 
event will happen and the success probability will increase 
for 'more' extreme events and for larger systems. The lat- 
ter seems to be in contradiction to our claim that predictabil- 
ity relies on finite size effects, but it is true if the magnitude 
of avalanches is measured relative to the size of the largest 
possible avalanche, so that, when comparing predictability in 
systems of different size, we also compare events of different 
absolute size. 

Model- Here, we analyze one of the most interesting 
SOC models, i.e. the Olami-Feder-Christensen (OFC) model 
Let us consider a two-dimensional (L x L) square lattice 
of N = sites, in which each of them carries a real variable 



Fi (initially a random value in [0,^4^^]), representing for in- 
stance a seismogenic force, and is connected by a link to its 4 
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nearest neighbours. Then, all these variables Fi are uniformly 
and simultaneously increased in order to describe a loading or 
stress accumulation (e.g., uniform tectonic loading). At some 
point, when one site becomes unstable, i.e. Fi > Fth with 
Fth being some threshold value, the driving is stopped (no 
extra time step) and a domino effect happens (discharging or 
stress release), i.e. an earthquake or avalanche starts. In other 
terms, one has: 



F., > Fth => 



^ nr. 







Fnn + aF, 



(1) 



with nn being the set of nearest-neighbor sites of i. The mag- 
nitude of the domino event will be so defined by the number 
of these topplings at time step i, i.e. Si, while a describes the 
presence of dissipation in the model. In the case of a square 
lattice, the dynamics is conservative for a = 0.25, while it is 
dissipative for a < 0.25. Here, we will consider a more real- 
istic version of this model, studied recently in Ref. Il5lll6ll . in 
which a small fraction of long-range links was introduced to 
obtain a small world topology [jTQ. In particular, the presence 
of a few long-range edges is enough to synchronize the system 
and both finite-size scaling and universal exponents appear 
1 1611 . The use of a small- world topology is justified also by the 
fact that long-range spatial correlations have been observed in 
nature, for instance in earthquake triggering and interaction, 
where the static stress may involve relaxation processes in the 
asthenosghere with relevant spatial and temporal long-range 
effects i 1 8l - l28ll . The construction of the small world network 
from a square lattice of size L is explained in more details 
m Refs. 11 m. An example of the small world topology 
construction is qualitatively shown in Fig. ([T]). Basically, the 
links of the lattice are rewired at random with a probability 
Prew and small world transition and self-organized critical- 
ity (with finite size scaling with universal critical exponents 
for the avalanche size probability distribution) are observed at 
Prew = 0.02, in the dissipative regime, i.e. a = 0.21 ll29ll . 
Notice that, in the thermodynamical limit, the size probability 
distribution is a power-law (without cutoff) and any avalanche 
size is possible, so both the largest and the mean avalanche 
size are infinite. However, in a finite size system, i.e. for a 
finite L, this probability distribution is a power law with a cut- 
off, induced by the finite extension of the lattice, and we will 
indicate with Smax the largest avalanche size. This quantity 
will be numerically determined in the following. For a generic 
initial condition, the OFC model on the small world topology, 
after a transient (discarded later) to build up spatial long-range 
correlations, reaches a critical state and generate a time series 
of avalanche size {si}, i — 1, . . . ,ti. In particular, we will 
analyze a time series of n = 10^ events. Moreover, we define 
the recurrence time of avalanches of size larger or equal than s 
as the time interval ( j — i) in between two consecutive events 
Si and Sj such that Si > s and sj > s, with j > i. For infinite 
lattice size (thermodynamical limit), the recurrence time dis- 
tribution is an exponential for any s since successive events 
are always uncorrected to each other. However, as pointed 
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FIG. 2. Conditional probability P{X — l)\y) as a function of the 
decision variable y, for events with size exceeding 1%, 2%, 5%, 
10%, 15%, 20%, 25%, 30%, and 35% of the maximal avalanche 
size Smax, in the case of L = 128. 



out above, any real system and any numerical simulation in- 
volves a finite size and this thermodynamical limit behaviour 
does hold only for small avalanches (s <C L^), which occur in 
the bulk of the lattice and are not affected by the presence of 
boundaries. Indeed, the recurrence time distribution of large 
avalanches will be not exponential but actually one observes 
the suppression of short time intervals. The reason of this be- 
haviour is the following. After a very large avalanche, the sys- 
tem may relax into a sub-critical state and several time steps 
are necessary to drive it again into a critical regime. This ef- 
fect in turns implies some sort of repulsion between 'extreme' 
avalanches. These repulsive correlations are quite weak but 
useful enough to make predictions on the future occurrence of 
extreme events. 

Prediction algorithm- Following Ref. 11411 . we introduce 



a Boolean series associated to the events with a size larger 
than some threshold rj, i.e. {Xi} with i = 1, . . . , n such that 
Xi ~ 1 if Si > rj and Xi ~ otherwise. The idea is that, by 
using information from the past, e.g. from one half of the time 
series (i = 1, . . . , n/2), we want to predict if the next event 
variable Xj (with j = n/2 + 1, . . . , n) is 1 or 0, i.e. whether 
the next event will exceed the size 77. In order to do that, we 
define a decision variable series {?/;} as follows 



< a < 1, 



(2) 



with a being a time scale tuning the quaUty of the prediction. 
Here, for any threshold 77, as in Ref. 1I14I1 . a will be chosen 
of the form a = cxp{-l/r(L)} with T{L = 16) = 56, 
T{L = 32) = 112, T{L = 64) = 225, and T{L = 128) = 
500, which are numerically optimized. Notice that in the case, 
for instance, of L = 128, a is such that the lOOOth event 
before the time i, i.e. k = 1000 in Eq. ([2]), has a weight 
1% smaller than the one used for much smaller k. In other 
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words, the time scale of the repulsive correlations, discussed 
above, can be at most considered of the order of thousands 
events. Besides, let us stress that yi does depend only on the 
past events and so from the knowledge of the past time se- 
ries one wants to forecast the future events. As discussed in 
Ref. iffl . the possibility of making predictions is subordi- 
nated to the fact that the conditional probability P{X = l\y), 
i.e. the probability for an avalanche with size larger than rj to 
occur given a specific value of the decision variable y, must 
show at least a maximum as a function of y. This necessary 
condition for predictability is investigated in our case for sev- 
eral values of ?/ in terms of s^ax, as shown in Fig. (|2]i. We 
find that for small sizes, i.e. 77 equal to a few % of Smax, 
the conditional probability P{X = l\y) has almost a flat be- 
haviour, implying that the probability for an event to happen 
is completely independent on the past, i.e. no predictability 
is possible. In this case, the finite-size effects are irrelevant 
and the behaviour of small avalanches well approximates the 
temporal features of the thermodynamical limit. However, for 
large avalanche sizes P{X = 1 |y) shows a maximum at small 
values of y, i.e. it is more likely to have a large event when 
only small avalanches happened in the recent past. It is an- 
other way to show that extreme events repel each other and 
between two consecutive ones the system is in a sub-critical 
state and generates mainly small avalanches. Let us point out 
also that the maximum of P{X = l\y) is more pronounced 
for larger 77, i.e. larger events are more predictable. Now, we 
will describe in a more quantitative way how to make predic- 
tions once one knows the decision variable y. In particular, 
one needs to define a probabilistic predictor on {yi}, defined 
as a map y^ H> pi, with pi e [0, 1]. It can be proved that 
the optimal predictor is simply characterized by the condi- 
tional probability, i.e. p, = P{X = l\yi) HIM Ell. In 
general, this predictor is probabilistic, since it is defined by 
P{X = but it can converted into a deterministic one by 
simply introducing a threshold, as follows 



X, 



pred 



if P{X = 
otherwise. 



(3) 



with Taiarm being a control parameter determining the total 
alarm rate, while Xf^"^"^ is the predicted value of the Boolean 
variable Xi. By increasing the value of r alarm , the total alarm 
rate decreases, i.e. one predicts one event only when the oc- 
currence probability is really high. Of course, a posteriori 
we may check whether our prediction is right or false by just 
comparing the variables Xi and xf^'^'^. On one hand, when 
Xi = j^pred _ jj^g prediction is correct and this event is 
counted in the so-called hit rate. On the other hand, when 
Xi ^ j^P'^erf _ 2^ the prediction is wrong and it does corre- 
spond to a false alarm. Notice also that, if r alarm = 0, the 
condition in (O is always satisfied and both rates tend asymp- 
totically to 1. Instead, for r alarm close to the maximum of 
P{X = l\yi), that inequality does never hold and both rates 
are zero, i.e. one does not make any prediction. 

Specifically, we will use the first half of our avalanche time 
series of 10® events as a training set to construct the predictor. 
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FIG. 3. ROC plots for the prediction of extreme events, for events 
with size exceeding 1%, 2%, 5%, 10%, 15%, 20%, 25%, 30%, and 
35% of the maximal avalanche size Smax (from bottom to top), in the 
case of L = 128. These curves are squeezed to the colored (blue) 
area in the case of L = 64, i.e. the predictabihty is better for larger 
system sizes. This behaviour does hold also for other smaller lattice 
sizes, that we have considered, e.g. L = 16 and L = 32. Inset: Hit 
rate and false alarm rate versus the threshold Taiarm (normalized to 
max[P(X = l|y)]), for 77 = 10%, 35% s^a.. 



which means that we estimate P{X = 1 1 jy^) on these data. We 
then feed this P{X = l|y;) with yi values from the second 
half and compare the outcome to the corresponding boolean 
values Xi. 

Prediction quality and results- Here, we analyze in more 
detail the quality of the y-based predictions in the case of the 
OFC model on a small world network, described above. When 
we use the prediction algorithm, there are two possible er- 
1) missing an event, i.e. Xi — 1 and 
X, = and Xf""'' = 1. 
A possible method to characterize the prediction quality is the 
Receiver Operating Characteristics (ROC) ll32ll . which was re- 
cently used in Ref. 11411 for a prediction analysis in a sand- 
pile model. It consists of comparing (in ROC plot) the hit 
rate Vhit and the false alarm rate r false, as a function of the 
threshold r alarm- The benchmark is the case Vhu = r false, 
i.e. the diagonal in the ROC plot. This is the outcome if 



rors (X, ^ Xr'') 
j^pred _ jj-j false alarm, i.e 



predictions x^^'^ ~ 1 are made at random times, indepen- 
dent of the values of yi. When Vhu > r false, the predictor 
is useful and the distance of the ROC curve from the bench- 
mark diagonal is a good indicator of the quality of the pre- 
diction. The ROC curves for the model above in the case of 
L = 128 are shown in Fig. Q for different thresholds ?/ in 
terms of the maximum avalanche size Smax- Note that, in- 
creasing the value of Vaiarm, one can get a very large hit rate 
with a small false alarm rate (i.e., very good prediction). The 
extreme points of each ROC curve corresponds to the triv- 
ial situations: a) r alarm = 0, with rh^t = r false = 1, b) 

Taiarm = maxP(X = l\yi) with rhit = r false = 0. Let 

us stress that the predictability is better for larger threshold 
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avalanche sizes. In the inset of Fig. [3] we show the behaviour 
of the hit rate and the false alarm rate as a function of r alarm 
normalized to the maximum value of the conditional proba- 
bility P{X = When lowering the threshold r alarm, the 
sensitivity increases, and for large avalanches (77 = 35%), the 
hit rate starts to rise much earlier than the false alarm rate, 
indicating the predictive skill. 

We have applied this analysis also to smaller system sizes, 
e.g. L = 16, 32, 64. There are two essential results: 
For given system size, we always detect some nontrivial pre- 
dictability of large events, and the predictability is the better 
the larger the events we intend to predict. This is an evident 
consequence of the fact that these are more strongly affected 
by the finiteness of the network. When comparing different 
system sizes and predicting events of identical magnitude, the 
smaller system is better predictable. In this respect, the pre- 
dictability asymptotically vanishes for infinite system size, as 
expected. If, however, we measure event magnitudes as per- 
cent of Smax, then larger systems are better predictable. For 
instance, for L = 64, all ROC curves are squeezed to the blue 
area in Fig. (O and the prediction quality is lower that in the 
case of L = 128. Finally, we consider how the predictabiUty 
quality changes by varying the total number of events in the 
time series, i.e. n. Indeed, a so large number of events, like 
10*, could be not accessible for real phenomena. However, 
in Fig. m we show the optimal prediction quality, measured 
as the largest distance of the ROC curve from the benchmark 
(Xhit = f false), and the corresponding hit rate and false alarm 
rate as a function of the total number of events n. We find 
that the predictability is similar even constructing the decision 
variable on a number of events which is three orders of mag- 
nitude smaller than the one used above. In other words, this 
prediction analysis seems to be also quite robust with respect 
to the size of the accessible sample. 

Conclusions and Outlook- We have investigated the time 
series of extreme events generated by a dissipative Olami- 
Feder-Christensen model on a small world network. The 
small world property is here essential to create vary large 
events - without small world, the system is not truly critical 
1 1511 . The presence of finite-size effects induces repulsive time 
correlations between consecutive extreme events and this can 
be used to make predictions. In particular, we have consid- 
ered a decision variable which keeps record only of the re- 
cently past events and, by using the conditional probability as 
optimal predictor, we have shown that the predictability qual- 
ity is really good for large avalanche sizes and for larger net- 
works. In this respect, we have applied a ROC analysis and 
found that the ratio hit rate/false alarm rate can be remark- 
ably high when considering 'more extreme' events. There- 
fore, these results show that, although the SOC models can 
be applied to describe very well real events (implying also 
that they are uncorrected in time), however, the fact that real 
physical systems and practical models have a finite size can be 
exploited to extract information from the extra temporal corre- 
lations, induced by the finite-size effects, and forecast the oc- 
currence of extreme events. Interestingly enough, the quality 
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FIG. 4. Optimal prediction quality, measured as the largest distance 
of the ROC curve from the benchmark {rhu = r false), versus n, in 
the case of L = 128 and 77 = 35% Smax- Inset: Optimal hit and false 
alarm rates versus n. The predictability is not affected by remarkably 
reducing the number of events used to define the decision variable. 



of the predictability is higher for larger systems and for more 
'catastrophic' events, whose predictability is, of course, even 
more urgently desired. Finally, although maybe most real phe- 
nomena may not share the properties of the SOC model ana- 
lyzed above, our results suggest to exploit the presence of, 
though weak, temporal correlations (not necessarily coming 
from finite-size effects, but also from other sources), to try to 
forecast extreme events. 
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